Lindstedt series and Kolmogorov theorem. 
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Giovanni Gallavotti* 
Abstract: the KAM theorem from a combinatorial viewpoint. 

Lindstedt, Newcomb and Poincare introduced a remarkable trigonometric series motivated by the 
analysis of the three body problem, [P]. In modern language, [G2], it is the generating function, that I 
call Lindstedt series here, h(ip) = £ k hP^ (tp) of the sequence (tp) of trigonometric polynomials 

associated with well known combinatorial objects, namely rooted trees. 

It is necessary to recall, first, the notion of rooted tree as used here. We lay down one after the other 
on a plane k pairwise distinct unit segments oriented from one endpoint to the other (respectively the 
initial point and the endpoint of the oriented segment also called arrow or branch) . 
The rule is that after laying down the first segment, the root branch, with the endpoint at the origin 
and otherwise arbitrarily, the others are laid down one after the other by attaching an endpoint of 
a new branch to an initial point of an old one and leaving free the branch initial point. The set of 
£■ — ' initial points of the object thus constructed will be called the set of the tree nodes. A tree is therefore 

a partially ordered set with top point the endpoint of the root branch, also called the root (which is 
not a node). 

The angles at which the segments are attached will be irrelevant: i.e. the operation of changing 
the angles between arrows emerging from the same node (each arrow carrying along, unchanged, the 
0^ ■ subtree of arrows possibly attached to its initial point) generates a group of transformations and two 

trees that can be overlapped by acting on them with a group element are regarded as identical. The 
£>y number of trees with k branches is thus bounded by 4 k k\. 

With each tree node v we associate an incoming momentum, or " decoration" , which is simply an 
q integer component vector v v ; with the root of the tree (which is not regarded as a node) we associate 

| a label j = 1, . . . , I. 

■ With each branch A = v'v, with final point v' and initial point v, we associate another integer 

component vector, the branch momentum "flowing through the branch", defined by v(v) — Ylw<v ^ w 
(we shall also denote v(v) by the symbol v(X))- Then, given a positive matrix J and a trigonometric 
polynomial f(tp) — J2o<\p\<n fv^osv ■ tp, fv = f-a, we consider from now on only decorated trees i? 
with k branches, such that v(y) ^ for all v, and associate with each decorated tree the value: 

v<r v v 

where v' is the node immediately following v in uj ■ v(v) will be called the divisor of v'v; here V r 
denotes the unit vector in the j'-th direction e*j, j — 1, . . . , I. The momentum flowing through the root 
will be denoted also . The Lindstedt-Newcomb-Poincare ("LNP") polynomial Jv- k \ij)) is defined 
byE^fe* " 
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■d,v{d)=v 

The main result is, if \lj ■ 1 < C\v\ T for some C, t > 0: 
Theorem: The Lindstedt series is convergent for e small enough. 

Remark: The first proof (of a more general result) is due to Kolmogorov, [K]; a conceptually new 
proof came much later and is due to Eliasson, [E], quite difficult to read but definitely very nice and 
correct. Further proofs were given by [G2] (extracted from [Gl]) and by [CF1]: somewhat different 
from the proof in [E] (and remarkably close to each other (except for the size), although independent). 
The main contribution of the latter papers was to realize that the proof could be greatly simplified 
by restricting it to a special case. The above case is the simplest and it is discussed in [G1],[G2]: 
even mild generalizations of the simplest case, [CF1], tend to become quite involved particularly if 
inappropriate notations are used: a comparative analysis of the different proofs will appear in [GM4]. 

proof: Let us call a branch v'v of a tree $ a scale n branch, if 2™ _1 < C\v(v) ■ uj\ < 2™ where 
n = 0, —1, —2, ... or a scale 1 branch if C \Q ■ v(v)\ > 1. 

We call a cluster of scale n in the tree i? a maximal connected set of branches of scale m > n. 

Notable clusters consist of the clusters V with only one entering branch Ay (and, of course, only one 
exiting branch X' v ). If ny is the scale of the cluster then the scale n\ v of the branch Ay is necessarily 
n\ v < n v . If v e V call v°(v) = J2wev,w<v V w 

A cluster V of the above type is called a resonance or a resonant cluster if t?(Ay) = v{\' v ) and also 
the number AT of branches inside the cluster is not too large: Af < ^2-( ni v+ 3 )/ T ; hence it is so small 
that \C uj ■ v°(v)\ > 2" A v +3 . A resonant cluster may contain resonant subclusters: we shall denote V 
the set of nodes in V which are outside the possible resonant subclusters of V, and we call them the 
free nodes of V. 

A resonance has two scales attached to it: the scale ny of the resonant cluster and the scale n\ v of 
the branch entering the resonance. We shall distinguish them by calling them the inner scale and, 
respectively, the outer scale of the resonance: it is of course ny > n\ v . But the latter inequality 
can be made stronger: in fact the previous paragraph shows that the scale n\ of a branch A in V is 
determined by the size of the "divisor" C (uj ■ z?°(A) + e) with C\e\ < 2™ A v , so that the size of the 
divisor is > 2" A v+ 3 - 2™ A v > 2™ A v+ 2 , hence: 

n\ > n v > n Xv + 3 (3) 
for all A in a resonance V with outer scale n\ v and inner scale ny. 

Given a branch with endpoint on a node v of a tree "& we define the operation of attaching or shifting 
the branch to another node w. it consists in detaching the whole subtree preceding v (which has v as 
root) and of attaching it to another node w: the result is a new tree 

One then considers all the trees d obtained from a given tree # by one of the following operations: 

(i) detaching the endpoint of a branch Ay entering a resonance V and attaching it (in the above sense) 
to all possible free nodes v G V: an act that will be called the shift operation on V. 

(ii) changing the sign of all (simoultaneously) the incoming momenta V v of the free nodes v e V: the 
parity operation on V . 

This produces a family J-(i3 ) of trees containing not too many trees: at most FJy 2my if my is the 
number of free nodes inside a resonance V. Since J2v m v < k the number of elements of .F(i?o) does 
not exceed JJ V 2my < 2 2fc , (using m < 2 m ). 

Furthermore a condition will be imposed on lj (see below) such that given two trees do and $i either 
^i^o) = -^(^l) or -^(^o) l~l ^"(^i) = 0- This can be done for the following reason. 



2 



Consider a branch Ay with scale n\ v entering a resonant cluster V with inner scale riy > n\ v and 
attached to a node v\ in V; then if Ay is shifted to be attached to a node vi € V the momenta flowing 
through the branches inside V may change by an amount j?(Ay): therefore their scale may change by 
at most 2™ A v . 

If the resonance V is inside many resonances V2, V3, . . . with outer scales n 2 > n 3 > .. (note that 
n\ v > n 2 ) then by shifting the branches Ay entering Vi in all possible ways to be attached at different 
free nodes inside Vi the scale of the branches in V cannot change by more than 2™ A v + 2" 2 + 2™ 3 + . . . < 

Since the number j\f < -I,2~( ,lA v+ 3 )/ T of branches inside the resonance V is not too large the vector 
v°(v) = J2 W £Vw<v^w is not longer than 2~( nx v+ z )/ T for any v G V. Hence assuming that the 
following property holds for ul: 

min \C\uo-u\ - 2 P | > 2 n+1 if n < 0, |j7| < 2-( n+3 >/ r (4) 

n<p<0 

we see that the number x v = Cuj ■ v°(v) is not only > 2™ A v +3 , but also it is further away from the 
extremes of the interval I p = (2 P_1 ,2 P ] or, for p = 1, I\ = (l,+oo) containing x v , by an amount 
2™ A v +1 at least. 

This means that by shifting the branches entering the resonances external to a resonant cluster V 
of outer scale n\ v one cannot change the scales of the branches inside V (because for v € V it is 
determined by the size of Cuj • v°(v) plus a quantity which is at most 2™ A v +1 , while the distance of 
Cuj ■ j7°(w) from the extremes of the intervals I p is greater). 

The scales of corresponding branches in the elements of a family ,F(t9o) do not change by shifting 
operations nor they do change by the parity operations, provided Eq. (4) holds: in particular 2" A_1 < 
\Cuj ■ v°(X)\ < 2™ A , for any A in V. Here two branches of two trees in ^"(i?o) are "corresponding" if 
they can be superposed by performing suitable sequences of shift or parity transformations in (i) , (ii) 
above. The part of the statement concerning the parity is easily checked, too. 

Therefore, assuming Eq. (4) for the time being (note, however, that almost all w's verify Eq. (4)) we 
shall use the following representation of the function h^(ip): 

Lindstedt series representation: 

(a) consider all the trees with k branches and collect them into disjoint families: each family is 
obtained from a tree d in it by the operation of shifting the branches entering a resonance V to the 
various free nodes in V , or of reversing simoultaneously the signs of the momenta v v incoming into 
the free nodes of V , for all resonances V , independently. 

(b) define the value of a family to be the sum of the values of the trees in it. 

(c) sum all the values of the families. 

This is a natural representation for h^: it was introduced in [G1],[G2] and with a minor variation, 
based on the slightly different notion of resonance used by [E], in [CFI] (independently). 

The above representation seems also close to the one proposed in [GL]. In fact it is probably very 
natural if one recalls that there is a symplectic structure in the problem that generates the Lindstedt 
series, see comments (7,8,9) below. Evidence in this direction is provided by the alternative scheme 
of proof followed by [E], where the bound on the convergence radius of the series is deduced from 
an argument in some respect simpler (but abstract) than the one below, where the cancellations 
happening when one sums all the tree values of trees in a given family are checked "indirectly" as a 
consequence of the symplectic symmetry of the mechanical system whose theory leads to the series, 
through an idea that goes back to [P], see [GM4]. 

The proof will now proceed to the easy check that the value of each family is bounded by B k for 
some B > 0: since there are at most as many families as trees (i.e. < k\4 k ) this implies the theorem. 

Fix a family ^"(i?) of trees. And consider the set of the innermost, or first order, resonances Vi. Then 
we shall call ey = Cuj ■ v(\v) where V G Vi is a resonant cluster with incoming momentum v(\v) 
and we shall sum over the trees obtained from 1? by the shift and parity operations on the resonances 



3 



in Vi: if Im^— J2veVi ^ m v the number of trees involved in the sum is Jl^eVi ^ m v < 2 2mi , (by 
2m < 2 2rn ). 

The "value J\($) of the subfamily" of CF{"&) so obtained can be regarded as a function of the complex 
parameters {ey}, with V varying in V\ defined as follows. Note that the momentum flowing in a 
branch A = v'v C V is Co ■ v°{v) + a v u) ■ v{\y) with a v = 0, 1; then the function is constructed by 
replacing the quantities (Co ■ v°(v) + <t v uj ■ V(\y))~ 2 by (Co ■ j7°(v)A + a v ey)~ 2 for all the branches 
v'v C V. 

Note that this implies that in regarding the value of .Fi(#) as a function of the ey's the divisors 
associated with the branches entering or exiting the resonance V are not represented as e v 2 . 

It is now easy to see that the value of a family is holomorphic in the variables ey in the disks 
\sy\ < 2 nv ~ 3 . This is because in any tree of the family it is C Co ■ v°(v)\ > 2 nv ^ 1 (by the remarks 
following Eq. (4)) so that for \e v \ < 2 nv ~ 3 it is \Clj ■ v°(v) + ae v \ > 2 riv ~ 3 . In fact, for later use, one 
sees that the last inequality holds even in the larger disk \sy\ < 2 nv ~ 3 (l + j) (the ^ will arise later 
as the sum of the geometric series with ratio g). 

This means that the value Ji({ev}veVi) of the subfamily Ti{"d) can be bounded above, for \ey\ < 
2«v-3 ^ anc j j n £ ac ^ even £ or | £v | < 2 nv ~ 3 ^)> by trivially bounding the divisors in the contributions of 
the various trees in terms of their scales, or by using \C u ■ v°(v) + crey| > 2 nv ~ 3 . One finds therefore, 
from Eq. (1), the bound: 

22m 1 ^Y d -g/o 2m i \ ^ fo^N 2 ^ j-j2-2(n A -3) _ o2mi^ /o^Y J~J 2~ 2nN ^ (5) 

k. Jo ^ Jo n 

if N n is the number of branches with scale n, /o = max \ fp\ and Jo is a lower bound to J. 
It is easy to check that the function Fi({ev}) vanishes in each sy to second order for ev = 0. This is 
simply because the sum of the momenta V Vj incoming into the nodes Vj E V, J2j Vv } , m each resonance 
V, vanishes, (by definition of resonance). Furthermore the only variation of the value of a tree when 
the branch entering a resonance is shifted to the node vj from the node vy is the replacement of a 
factor V v ., ■ J _1 i?', for some V, with the factor v Vj ■ J _1 j7', when ey — 0. Hence if, ey — for some V, 

Fi({sv}) is proportional to J2 v ev — and vanishes to first order in each of the ev- The summation 
over the sign reversals turns the function Fi({ey}) into an even function of the £y, hence vanishing 
to second order. 

Therefore the above bound Eq. (5) in the domain \ey \ < 2 nv ~ 3 implies, by the maximum principle, 
that the bound can be improved by a factor JlyeVi i -^v- a ) ' which becomes, if one replaces ey by 
its actual value: 

JJ 2 2(n, v -„v+3) jf n Xv < Uy - 3 (6) 

Note that only the case n\ v < ny — 3 needs the maximum priciple. 

We now consider the second order resonances, i.e. the set of resonances V2 which only contain first 
order resonances. Then we denote again ey = Clu ■ v{\y) where V G V2 is a resonant cluster with 
incoming momentum i>(\y) and we shall set 2m,2 = XVev 2 ^my, if my is the number of free nodes 
in V. 

Summing over the < 2 2mi+2rri2 trees obtained from 1? by the shift and parity operations on the 
resonances in Vi,V2 we define the "value ^2(1?) of the subfamily" of J-2{°&) so obtained. It can be 
regarded as a function of the complex parameters {ey}, with V varying in V2. The momentum flowing 
in a branch A = v'v C V is Co ■ v°(v) + a v uj ■ v{\y) with a v — 0, 1 and the function is constructed, as 
in the previous case, by replacing the quantities (Co ■ v®{v) + a v u) ■ v(\y))~ 2 by (uj ■ P°(w)A + cr^ey) -2 
for all v'v C V. 

Furthermore when the variable £y varies in the disk \sy \ < 2 nv ~ 3 the divisors of the branches entering 
the first order resonances W C V have the form Co ■ +<r v uj ■ u{\y) with 2" A_1 < C\Cd-ifi(v)\ < 2" A 
and therefore if n v < n w - 3 they are bounded above by 2 nv + 2 nv - 3 < 2 nw - A + 2 nw - 6 < 2 nw ~ 3 . 
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Hence if n v < n w - 3 the value of the function is bounded by 2 2m2+2mi Y 1 X, for \e v \ < 2 nv ~ 3 : this 
remains true even if ny — nw — 3 for one or more W C V. In the latter cases, in fact, the momentum 
entering the resonance W needs not be < 2 nw ~ 3 but it is < 2 nv +2 nv - 3 = 2 nw - 3 +2 nw - & < 2" v - 3 f so 
that the branches A inside W are still such that C\iJ-u°(X)+a v e w \ > 2 nw - 1 -2 nw - 3 -2 nw ~ 6 > 2 nw - 3 
and taking Y\ = 1 provides a correct bound. 

Again the value F 2 (??) vanishes to second order in the variables Sv so that it can be bounded, if one 
replaces ey by its actual value, by 2 m2+mi Y{Y 2 X where Y<i is defined as in Eq. (6) with V 2 replacing Vi, 

now considers the third order resonances and so on until one finds the 
bound on the value of the complete family T(-d) given by 2 2mi+2m2+ - Y X Y 2 . . . X. Since 2m 1 + ... < 2k 
one finds, recalling the definition of X in Eq. (5): 

1 f 96/^2 , m T 

i.(Z2±i_) fe ]j2- 2 " w " [] Y[ J]2 2 ("-™^+ 3 ) (7) 

n n<0T,n T =ni=l 

if T are the clusters of the family of trees under consideration, wit is the number of maximal resonances 
contained inside the cluster T, and rij denotes the scale of the branch entering the i-th of such 
resonances. 

The number N n of branches of scale n can be bounded by imitating the key bound due to Siegel and 
Brjuno in the case of trees without resonances (the bound seems to have been well known since Siegel's 
theorem for the similar case of the linearization of the complex maps, which essentially corresponds 
to the analysis of the generating function of polynomials like the ones considered here and generated 
by resonanceless trees). 

If nix is the number of maximal resonances inside the cluster T then N n is bounded by: 

N n < AN2 {n+3 ^ T k + J2 ( m T- 1) ( 8 ) 

T,n T =n 

the proof in [G2] is reproduced, for completeness, in the appendix. 

Hence the product in Eq. (7) is bounded by combining Eq. (7) and Eq. (8), after some simple power 
counting, by: 

1 f f 2 12 C 2 N 2 ^ k -A- 9 _ 8 „jv2<"+ 3 >/-fc 1 R k /q\ 

*i( — Jo — } W 2 = k\ B ° (9) 

n— — 00 

with Bo = C a / J - 1 JV 2 2 12 2-S» 8nJV2(B+8)/T . 

The number of families cannot exceed the number of trees, i.e. k\2 2k , so that the series Eq. (1) 
converges for |e| < (4B ) _1 , which is even a rather good estimate, at least as a first estimate, for N 
small. 

This completes the proof in the case in which the inequalities Eq. (4) hold. Such inequalities were 
called strong diophantine property in [Gl], [G2]. If one examines carefully their role (i.e. keeping the 
scales unchanged while performing the operations (i), (ii)) one sees that it is very reasonable that 
they can be replaced by a suitable similar property which holds in the general case in which only the 
diophantine inequality (following Eq. (2) above) holds: this is discussed at the end of [G2] and proved 
in [GG]. 

Comments: 

(1) The restriction that / is a trigonometric polynomial can be lifted easily to treat analytic /'s, as 
shown in [GM2], (in [CF1] too the / is not supposed a polynomial). The restriction that / is even can 
be lifted simply by replacing the parity operation by the new operation of shifting the node to which 
the outgoing branches are attached in V, as shown in [CF1]. Note that the even case is extremely 
interesting for the applications as the perturbation functions /, in many cases, are even because this 
property is related to the time reversal symmetry (see [BCG]). 
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(2) The above method can be extended to study the generating functions of quantites originated in 
a way analogous to the Lindstedt polynomials in the perturbation theory of whiskered tori, see [Gl]: 
the first such results are in [Gel], [Ge2]. In the later review paper [CF2], aware of such references, a 
part of such results are also discussed and apparently claimed as new, if so improperly because the 
analysis in [Gel],[Gc2] is correct and complete (and, furthermore, it goes quite far beyond what is 
discussed in [CF2] in the cases common to both papers). 

(3) The method of [E] is not quite the same as the above, see [GM4]. 

(4) The above method reminds very much of field theory methods: the similarity was noted first in 
[FT], at a formal level, without taking advantage of it to study a proof of the theorem. The results of 
[GM1], [GM2] take full advantage of the similarity and build a proof (see comment (9) below). 

(5) In [G3] it is shown that there is a euclidean field theory whose one point Schwinger function is 
the function Eq. (2). This analogy is pushed further in [GGM], where a connection with the theory 
of the singularity in e of the function Eq. (1) as e grows is heuristically attempted, along an early 
suggestion in [PV]. 

(6) the convergence of the Lindstedt series generated by the general KAM theorem can be proved 
along the above lines, and as one should expect, it does not involve any new ideas, in fact see [CF2] 
for a somewhat more detailed sketch and [GM3] for a proof. 

(7) there are other ways besides the one of [E] to reach a representation similar to the above and 
giving immediately an absolutely convergent series for the generating function of the LNP polynomial. 
This has been shown in [GL] and it seems closely related to the fact that the problem is associated with 
the theory of hamiltonian evolutions. Also the method of [E] does not perform explicit cancellations 
on individual terms of the series, but it shows that they must happen because of the symplectic nature 
of the equations that the convergence of the LNP series would solve; hence it would be interesting to 
establish a clear connection between the method in [GL] and that of [E] : this may help establishing a 
more clear connection also between the above method and [G1],[G2] or [CF1]. 

(8) one can ask whether the above proof can be used for numerical purposes. At first sight one may 
think that the maximum principle used in the proof is an obstacle. However one should note that the 
sum over the tree values in a given family is an algebraic operation: hence the analytic estimate can 
be replaced by an algebraic one (this is exploited in [GM1], [GM2]). 

When one performs the sum corresponding to the shift of the incoming branch, or to the parity 
transformation, for a resonance V, one takes a linear combination of my — 1 products of divisors. 
Therefore the result is a polynomial in sy. The above remarks show that it starts at second order: 
hence one can factor e v (by a finite number of algebraic operations) which cancels one of the two 
divisors associated with the resonance incoming and outgoing branches. 

This means that each resonance only contributes a divisor not exceeding e v 2 2~ 2nv < 2~ 2nx v~ 2nv , in 
the estimates, instead of £y A < 2~ inx v : in other words the "extra" divisor on scale n\ v is replaced 
by a divisor on the generally much higher scale ny (hence much larger). It is not difficult to see that 
the phenomenon of accumulation of divisors expressed by the sum in Eq. (8) is precisely due to the 
fact that without taking into account the above remark the resonances contribute two divisors on the 
same scale (i.e. they generate a divisor £ V A ). Hence once the above algebraic operation is performed 
one has no more divisors in excess, in comparison with the Siegel's situation: this idea could also be 
used to provide a quick proof of Eq. (8) if one assumes it to be valid for resonanceless trees, a case in 
which it was well known being implicit in Siegel's and Brjuno's work. 

The remarkable and important result in [GL], that justifies the apparent absence of a cancellation 
analysis, is that the above collection of terms is automatically performed if one proceeds by keeping 
always explicitly into account the symplectic structure of the problem that is solved by the Lindstedt 
series. In this respect the work [GL] is much closer to [E] than [G1],[G2],[CF1]. 

(9) of course one could define the notion of resonant cluster by forgetting the condition on the 
number of branches: this would lead, if suitably complemented with a redefinition of the families of 
trees, to an overcompensation and to worse final estimates (and to a trivially more involved proof). 
This is done in [E], and in some way also in [GL] (but not in [G1],[G2],[CF1]). On the other hand 
it can provide a conceptual simplification as it has been shown in [GM2] , [GM3] where the possibility 
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of overcompensating is even exploited to use deeply the analogies of the above ideas with those of 
renormalization theory in quantum field theory (see (4) above) . The result is what I think is the most 
transparent and elementary proof of the KAM theorem, even simpler than the one decribed in the 
present paper (although the latter is likely to be better for numerical applications). 

(10) finally it is important to note that the results depend only on the smallest eigenvalue Jo of the 
matrix J and not on the maximum. Given the hamiltonian H = Q ■ A + ^AJ~ 1 A + £<p(a) where 
A e R l and a is in the torus T l (Thirring model, see [G2]), the Lindstedt series solves, see [G2], the 
problem of the existence of a KAM torus close to A = and with rotation vector lu: hence we see 
that the condition on the size of s is independent on the twist rate, which is the maximum eigenvalue 
of J. In the latter model the twist condition, that plays such an important role in the general KAM 
theorem, is not necessary: thus tori arising from the above analysis were called, in [G1],[G2], twistless 
KAM tori. 

Appendix Al: Resonant Siegel-Brjuno bound. 

Calling N* the number of branches of scale < n which are not entering a resonance (" non resonant 
branches"). We shall prove first that N* < 2fc(A2-<"+ 3 )/ r )- 1 - 1 if N n > 0. We fix n and denote N* 
as iV*(i9). Let e = \ and E = N~ 1 2~ 3s . 

If the root branck scale of i? is > n and if i?2, • • • , $m are the subtrees of z? attached to the root 
branch of i? and with k 3 > E 2~ £n branches, it is N*{$) = A*(i?i) + . . . + N*(-d m ) and the statement 
is inductively implied from its validity for k' < k provided it is true that N*("&) = if k < E2~ £n , 
which is is certainly the case if E is chosen as above. 1 

In the other case it is N* < 1 + Y^lLi and if m = the statement is trivial, or if m > 2 the 

statement is again inductively implied by its validity for k' < k. 

If m = 1 we once more have a trivial case unless the order k\ of $i is k\ > k — ^E 2~ n£ . Finally, and 
this is the real problem as the analysis of a few examples shows, we claim that in the latter case the 
root line of i?i is either a resonant line or it has scale > n. 

Accepting the last statement it will be: N*(&) = 1 + N*(di) = 1 + AT*(t?i) + . . . + N*(&' m ,), with 
fl'j being the m' subdiagrams emerging from the first node of with orders k'j > E2~ £n : this is 
so because the root line of $1 will not contribute its unit to iV*(i?i). Going once more through the 
analysis the only non trivial case is if m' = 1 and in that case A*(z9' 1 ) = AT*(i9"i) + . . . + A*(t9" m »), 
etc, until we reach a trivial case or a diagram of order < k — jE2~ ne . 

It remains to check that if k\ > k — ^E2^ n£ then the root branch of #i has scale > n, unless it is 
entering a resonance. 

Suppose that the root line of #i has scale < n and is not entering a resonance. Note that \u ■ 
v(vq)\ < 2™, \u> ■ v(v\)\ < 2™, if Vo,v\ are the first vertices of $ and $i respectively. Hence S = \(uj ■ 
(v(vo) — u(v\))\ < 2 2™ and the diophantine assumption implies that \v{vo) — z?(i>i) > (2 2™)~ T , or 
v{vq) = v{vi). The latter case being discarded as k — ki < ^E2~ n£ (and we are not considering the 
resonances: note also that in such case the lines in different from the root of $ must be inside 

a cluster, see footnote 3)), it follows that k — ki < ^E2^ n£ is inconsistent: it would in fact imply 
that u(vo) — v(vi) is a sum of k — k\ vertex modes and therefore \P(vo) — v{v{)\ < \NE2~ n£ hence 
S > 2 3 2 n which is contradictory with the above opposite inequality. 

The total number of branches with scale n is therefore N* + n T =n mT ^ mT 1S * ne num ber of 
resonances contained in T. A similar, far easier, induction can be used to prove that if N* > then 
the number p of clusters of scale n verifies the bound p < 2k (E2~ £n )~ 1 — 1; it will be left out (see 
[GG]). Thus (8) is proved. 

Remark: the above argument is a minor adaptation of Brjuno's proof of Siegel's theorem, as remark- 
ably exposed by Poschel, [P6]. 



1 Note that if k < £2~ ne it is, for all momenta v of the branches, \V\ < NE2~ ne , i.e. \<3 ■ V\ > (NE2-" E )- T = 2 3 2 
so that there are no clusters T with inner scale nj = n and N* = 0. 
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